CompareDx: a Software Toolkit for Measuring the Performance of Programs that Generate Multiple Diagnoses

نویسندگان

Hamish S. F. Fraser

William J. Long

چکیده

Introduction Evaluations of medical diagnosis programs have been carried out for several decades but for programs which produce multiple diagnoses there is a lack of suitable, well validated performance metrics. If a program reasons about only one (or a few) types of diagnosis, then the sensitivity and specificity of the program can readily be determined given a suitable standard diagnosis. However if the program is designed to reason about the possibility of dozens or hundreds of diagnoses other metrics may be required. Evaluating such programs usually requires a considerable amount of data per case and it is therefore difficult to collect more than 100 to 200 cases. This results in sparse data with many diagnoses appearing only once or twice in the evaluation (and many diagnoses not appearing at all). Calculating sensitivity and specificity for each diagnosis is therefore impractical, and only common diagnoses can be effectively evaluated. We have refined performance metrics for assessing diagnostic accuracy, evaluated them with data from a clinical evaluation study, and developed a Java program to implement the metrics efficiently.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New approaches to measuring the performance of programs that generate differential diagnoses using ROC curves and other metrics

INTRODUCTION Evaluation of computer programs which generate multiple diagnoses can be hampered by a lack of effective, well recognized performance metrics. We have developed a method to calculate mean sensitivity and specificity for multiple diagnoses and generate ROC curves. METHODS Data came from a clinical evaluation of the Heart Disease Program (HDP). Sensitivity, specificity, positive an...

متن کامل

A Two-stage DEA Model Considering Shared Inputs, Free Intermediate Measures and Undesirable Outputs

Data envelopment analysis (DEA) has been proved to be an excellent approach for measuring the performance of decision-making units (DMUs) that use multiple inputs to generate multiple outputs. But the allocation problem of shared inputs and undesirable outputs does not arouse attention in this movement. This paper proposes a two-stage DEA model considering simultaneously the structure of shared...

متن کامل

The semi-automatic parallelisation of scientific application codes using a computer aided parallelisation toolkit

The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. Historically, the lack of a programming standard for using directives and the rather limited performance due to scalability have affected the take-up of this programming model approach. Significant progress has been made in hardware and software technologies, as a result the p...

متن کامل

Measuring Iran\'s Industrial Clusters Competitiveness: Through Porterian Diamond Framework and Factors

Competitiveness is divided into two broad categories named as macroeconomic and microeconomic competitiveness. Microeconomic competitiveness indicators have a direct impact on company productivity within which sate of cluster development is the most important pillar of microeconomic competitiveness that are introduced by Michael Porter and it's framework as Diamond Model. The Diamond reveals al...

متن کامل

Measuring the Performance of the Virtual Teams in Global Software Development Projects

The development teams who are geographically spread, culturally mixed and mainly depend on information and communication technology (ICT) for communication is defined as a global virtual teams (GVTs). Despite the advancement of technologies, achieving the efficient performance of GVTs remains a challenge. The reviewed literature has highlighted the importance of training and development, organi...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

CompareDx: a Software Toolkit for Measuring the Performance of Programs that Generate Multiple Diagnoses

نویسندگان

چکیده

منابع مشابه

New approaches to measuring the performance of programs that generate differential diagnoses using ROC curves and other metrics

A Two-stage DEA Model Considering Shared Inputs, Free Intermediate Measures and Undesirable Outputs

The semi-automatic parallelisation of scientific application codes using a computer aided parallelisation toolkit

Measuring Iran\'s Industrial Clusters Competitiveness: Through Porterian Diamond Framework and Factors

Measuring the Performance of the Virtual Teams in Global Software Development Projects

عنوان ژورنال:

اشتراک گذاری